Automated indexing for making of a newspaper article database.
نویسندگان
چکیده
منابع مشابه
Automatic Indexing of Newspaper Microfilm Images
This paper describes a proposed document analysis system that aims at automatic indexing of digitized images of old newspaper microfilms. This is done by extracting news headlines from microfilm images. The headlines are then converted to machine readable text by OCR to serve as indices to the respective news articles. A major challenge to us is the poor image quality of the microfilm as most i...
متن کاملLinking article parts for the creation of newspaper digital library
An important issue pertaining to the retro-conversion of newspapers, i.e. the conversion of newspaper issues into digital resources, is the identification and appropriate digital representation of an article. To complete this task, a number of steps have to be followed, from segmentation of the newspaper image to optical character recognition and linking of different items belonging to the same...
متن کاملTextual Article Clustering in Newspaper Pages
In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and experiment on actual pages from the Italian newspaper L’Adige, we select one of the algorithms as the pre...
متن کاملExploitation of Newspaper-article Characteristics for Article Retrieval and Answer Extraction in QAC Task 2
In this paper, we discuss a system for newspaper article retrieval and answer extraction. Due to the rapidly increasing amount of accessible information, systems that allow search in natural language are expected to play a much more important role in the very near future. Our system, called RAIK-Prassie, is designed for TASK 2 of QAC. The design of the RAIK-Prassie system focuses mainly on prac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Information Processing and Management
سال: 1989
ISSN: 0021-7298,1347-1597
DOI: 10.1241/johokanri.32.283